Binaural Rendering in MPEG Surround

نویسندگان

  • Jeroen Breebaart
  • Lars F. Villemoes
  • Kristofer Kjörling
چکیده

This paper describes novel methods for evoking a multichannel audio experience over stereo headphones. In contrast to the conventional convolution-based approach where, for example, five input channels are filtered using ten head-related transfer functions, the current approach is based on a parametric representation of the multichannel signal, along with either a parametric representation of the head-related transfer functions or a reduced set of head-related transfer functions. An audio scene with multiple virtual sound sources is represented by a mono or a stereo downmix signal of all sound source signals, accompanied by certain statistical (spatial) properties. These statistical properties of the sound sources are either combined with statistical properties of head-related transfer functions to estimate “binaural parameters” that represent the perceptually relevant aspects of the auditory scene or used to create a limited set of combined head-related transfer functions that can be applied directly on the downmix signal. Subsequently, a binaural rendering stage reinstates the statistical properties of the sound sources by applying the estimated binaural parameters or the reduced set of combined head-related transfer functions directly on the downmix. If combined with parametric multichannel audio coders such as MPEG Surround, the proposed methods are advantageous over conventional methods in terms of perceived quality and computational complexity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-channel Goes Mobile: Mpeg Surround Binaural Rendering

1 Philips Research Laboratories, 5656 AA, Eindhoven, The Netherlands [email protected] 2 Fraunhofer Institute for Integrated Circuits IIS, 91058 Erlangen, Germany {hrr;pts}@iis.fraunhofer.de 3 Coding Technologies, 11352 Stockholm, Sweden {lv;kk}@codingtechnologies.com 4 Vast Audio, NSW 1430 Sydney, Australia [email protected] 5 Philips Applied Technologies, 5616 LW Eindhoven, The ...

متن کامل

The SoundScape Renderer: A Unified Spatial Audio Reproduction Framework for Arbitrary Rendering Methods

The SoundScape Renderer is a versatile software framework for real-time spatial audio rendering. The modular system architecture allows the use of arbitrary rendering methods. Three rendering modules are currently implemented: Wave Field Synthesis, Vector Base Amplitude Panning and Binaural Rendering. After a description of the software architecture, the implementation of the available renderin...

متن کامل

Binaural Cue Coding: Rendering of Sources Mixed into Amono Signal

This paper reviews Binaural Cue Coding (BCC). BCC is a lossy technique for either reducing either a number of source signals or a multichannel audio signal to one audio channel plus side information. In the case when a number of source signals (e.g. separately recorded instruments) are reduced to one audio channel plus side information, the BCC synthesis allows rendering of each source as if th...

متن کامل

Abstract: Person Tracking Sensor Based Multi-Channel Audio Panning for Multi-View Broadcasting Services

In this paper, a person tracking sensor based multi-channel audio panning approach is proposed for multi-view broadcasting services. Multi-view broadcasting is realized by rendering the video sequences captured by a set of cameras from different viewpoints. In addition, a multi-channel audio panning technique is required for realistic audio rendering. Moreover, person-tracking techniques for es...

متن کامل

Spatial Audio with the SoundScape Renderer

The SoundScape Renderer (SSR) is a versatile tool for realtime spatial audio reproduction, implementing a variety of headphoneand loudspeaker-based methods. Among others this includes Wave Field Synthesis, Higher Order Ambisonics and dynamic binaural synthesis. The SSR is free software licensed under the GNU General Public License. It uses the JACK audio framework and is currently available for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • EURASIP J. Adv. Sig. Proc.

دوره 2008  شماره 

صفحات  -

تاریخ انتشار 2008